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Abstract 

Let 7 „ be the standard Gaussian measure on R" and let ( Q t ) be the 
Ornstein-Ulhenbeck semigroup. Eldan and Lee recently established that 
for every non-negative function / of integral 1 and any time t the following 
tail inequality holds true: 

> r» < C« Ssgsg., W >1 
rylogr 

where Ct is a constant depending on t but not on the dimension. The 
purpose of the present paper is to simplify parts of their argument and to 
remove the (log log r) 4 factor. 


1 Introduction 


Let 7 „ be the standard Gaussian measure on R n and let ( Q t ) be the Ornstein- 
Ulhenbeck semigroup: for every test function / 

Qtf(x)= f f (e~ f x + y/l - e~ 2t y^j 7 n {dy). (1) 

J R n 

Nelson [ 6 ] established that if p > 1 and t > 0 then Q t is a contraction from 
Lp(j n ) to L q { 7 „) for some q> p, namely for 

q =l + e 2t (p-l). 


The semigroup (Qt) is said to be hypercontractive. This turns out to be equiva¬ 
lent to the logarithmic Sobolev inequality (see the classical article by Gross [3]). 
In this paper we establish a regularity property of Qtf assuming only that / is 
in L l (^ n ). 

Let / be a non-negative function satisfying 



= 1 , 


*CEREMADE (UMR CNRS 7534) Universite Paris—Dauphine. 


1 






and let t > 0. Since Q t f > 0 and 

f Qtf d^ n = f fd'y n = l, 

J R" J R" 

Markov inequality gives 

7 n {{Qtf > r}) < 

r 

for all r > 1. Now Markov inequality is only sharp for indicator functions and 
Qtf cannot be an indicator function, so it may be the case that this inequality 
can be improved. More precisely one might conjecture that for any fixed t > 0 
(or at least for t large enough) there exists a function a satisfying 

lim a{r) =0 

r—>■+OO 


and 

ln{{Qtf>r})< (2) 

for every r > 1 and for every non-negative function / of integral 1. The function 
a should be independent of the dimension n , just as the hypercontractivity result 
stated above. Such a phenomenon was actually conjectured by Talagrand in [7J 
in a slightly different context. He conjectured that the same inequality holds 
true when 7 „ is replaced by the uniform measure on the discrete cube {—1, l} n 
and the Orstein - Uhlenbeck semigroup is replaced by the semigroup associated 
to the random walk on the discrete cube. The Gaussian version of the conjecture 
would follow from Talagrand’s discrete version by the central limit theorem. In 
this paper we will only focus on the Gaussian case. 

In pp, Ball, Barthe, Bednorz, Oleszkiewicz and Wolff showed that in dimen¬ 
sion 1 the inequality J2J) holds with decay 

, X c 

a{r) = r - , 

Vlogr 

where the constant C depends on the time parameter t. Moreover the authors 
provide an example showing that the 1/yiogr decay is sharp. They also have 
a result in higher dimension but they loose a factor log log r and, more impor¬ 
tantly, their constant C then tends to +00 (actually exponentially fast) with the 
dimension. The deadlock was broken recently by Eldan and Lee who showed 
in j2] that © holds with function 

^ ^ ( lo g lo g0 4 

ot{r) = C - - , 

Vlogr 

with a constant C that is independent of the dimension. Again up to the log log 
factor the result is optimal. 

In this article we revisit the argument of Eldan and Lee. We shall simplify 
some steps of their proof and short cut some others. As a result, we are able to 
remove the extra log log factor. We would like to make clear though that this 
note does not really contain any new idea and that the core of our argument is 
all Eldan and Lee’s. 
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2 Main results 


Recall that j n is the standard Gaussian measure and that ( Q t ) is the Ornstein 
Uhlenbeck semigroup, defined by ©■ Here is our main result. 

Theorem 1. Let f be a non-negative function on R" satisfying f Rn f d"/ n = 1 
and let t > 0. Then for every r > 1 


7n {{Qtf >r})<C 


max(l,f x ) 
ry/ log r 


where C is a universal constant. 

As in Eldan and Lee’s paper, the Ornstein Ulhenbeck semigroup only plays 
a role through the following lemma. 

Lemma 2. Let /: R" —► R+. For every t > 0, we have 

V 2 log (Qtf) > — id, 


pointwise. 

Proof. This really straightforward: observe that © can be rewritten as 

Qtf(x) = {f * gi- p )(px), 

where p = e _t and g\~ p is the density of the Gaussian measure with mean 0 
and covariance (1 — p)id. Then differentiate twice and use the Cauchy-Schwarz 
inequality. Details are left to the reader. □ 

What we actually prove is the following, where Qt does not appear anymore. 

Theorem 3. Let f be a positive function on R ra satisfying f Rn f d"/ n = 1. 
Assume that f is smooth and satisfies 

V 2 log / > —/3 id (3) 


pointwise, for some /3 > 0. Then for every r > 1 


In ({/ > r-}) < 


C max(/3,1) 
r\/log r 


where C is a universal constant. 


Obviously, Theorem [3] and Lemma [2] altogether yield Theorem [TJ 
Let us comment on the optimality of Theorem |T] and Theorem [3] In dimen¬ 
sion 1, consider the function 


U(x) 


_ ^OLX — Ot 2 /2 
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Observe that f a > 0 and that J R f a d"f = 1. Note that for every t > 1 we have 

ce - * 2 / 2 

7i {[t,+o o)) > ---, 

where c is a universal constant. So if a > 0 and r > e then 


7i ({fa > r}) > 


■=» P (-i(!T + f) 3 ) 

log r a 
a ' 2 


Choosing a = v / 2Togr we get 


7i ({/a > r}) > 


Vlog r 


Since (log/ Q )" = 0 this shows that the dependence in r in Theorem [3] is sharp. 
Actually this example also shows that the dependence in r in Theorem [1] is 
sharp. Indeed, it is easily seen that 


Qtfa fae —t i 

for every a G R and t > 0. This implies that f a always belongs to the image 
Q t . Of course, this example also works in higher dimension: just replace f a by 

fu (x) = e<“^>-l“l 2 / 2 


where u belongs to R ra . 


Theorem 4. Let X be a random vector having density f with respect to the 
Gaussian measure, and assume that f satisfies ©• Then for every r > 1 


P (f(X)e(r,er])<C 


max(/3,1) 
Vlog r 


Theorem 0] easily yields Theorem [3] 


Proof of Theorem [5J Let G be standard Gaussian vector on R” and let X be a 
random vector having density / with respect to 7 „. Then using Theorem H] 

+oo 

P[/(G) > r] = £ P (/(G) e (e fc r, e fc+1 r]) 

k—0 

+oo 

< E(^)' lE [/( G ) !{ f(G)e(e*r,e*+ir}}] 
k =0 

+oo 

= J2(e k r)~ 1 P (f(X) £ (e k r,e k+1 r}) 
k—0 

t^o \/ lo g( efe? ') 

< c e 1 max(/3,1) 

— e — 1 r Vl°g r 
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which is the result. 


□ 


The rest of the note is devoted to the proof of Theorem 0] 


3 Preliminaries: the stochastic construction 

Let /i be a probability measure on R" having density / with respect to the 
Gaussian measure. We shall assume that / is bounded away from 0, that / is 
C 2 and that V/ and V 2 / are bounded. A simple density argument shows that 
we do not lose generality by adding these technical assumptions. 

Eldan and Lee’s argument is based on a stochastic construction which we 
describe now. Let ( Bt ) be a standard n-dimensional Brownian motion and let 
(Pt) be the associated semigroup: 

P t h(x ) = E [h(x + P t )], 

for all test functions h. Note that ( P t ) is the heat semigroup, not the Ornstein 
Ulhenbeck semigroup. Consider the stochastic differential equation 

(X 0 = ° 

\ dX t = dB t + V log(P 1 _ t /)(X t ) dt, t e [0,1]. 1 j 

The technical assumptions made on / insure that the map 

x ^ VlogPi_ t /(x) 

is Lipschitz, with a Lipschitz norm that does not depend on t £ [0,1]. So the 
equation Q has a strong solution ( X t ). In our previous work [4] we study the 
process (X t ) in details and we give some applications to functional inequalities. 
Let us recap here some of these properties and refer to [4[ section 2.5] for proofs. 
Recall that if fi ±, ^2 are two probability measures, the relative entropy of /ii with 
respect to ^2 is defined by 

H(mi I M 2 ) = J log d Ml) 

if Hi is absolutely continuous with respect to H 2 (and H(/zi | H 2 ) = +00 oth¬ 
erwise). Also in the sequel we call drift any process (ut) taking values in M” 
which is adapted to the natural filtration of ( B t ) (this means that Ut depends 
only on (B s ) s < t ) and satisfies 


|u t | 2 ds < + 00 . 


Let (vt) be the drift 


vt = VlogPr-t/pG). 
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Using Ito’s formula it is easily seen that 


dlog Pi-tf (Xt) = ( v t ,dB t } + ^\v t \ 2 dt. 
Therefore, for every t G [0,1] 


Pi-tf(Xt) = exp 



( v s ,dB s } + - 



( 5 ) 


Combining this with the Girsanov change of measure theorem one can show that 
the random vector Xi has law /i (again we refer to [Jj for details). Moreover we 
have the equality 


h (m | In) = 



( 6 ) 


Also, if (ut) is any drift and if u is the law of 


Bi + 



Ut dt , 


then 


H OlT n) < ^E 



(7) 


So the drift (vt) is in some sense optimal. Lastly, and this will play a crucial 
role in the sequel, the process (vt) is a martingale. 

Eldan and Lee introduce a perturbed version of the process (Xt), which we 
now describe. From now on we fix r > 1 and we let 


T = inf{t G [0,1], P±— t f (X t ) > r} A 1 

be the first time the process (Pi_ t f(Xt)) hits the value r (with the convention 
that T = 1 if it does not ever reach r). Now given S > 0 we let (Xf)) be the 
process defined by 

rTAt 

X? = X t + 5 v s ds. 

Jo 

Note that this perturbed process is still of the form Brownian motion plus drift: 


Xt — B t + f (1 + ^l{s<T})r , s ds. 
Jo 


So letting be the law of Xf and using 0 we get 


H(// I 7 ) < 77E 


= -E 

2 


f (1 + S 1{ s < T }) 2 |i; s | 2 ds 
Jo 


It’s 1 2 ds 


1 ( d 2 \ 

’ r T 

+ [<S+— E 

/ \v s \ 2 ds 

V 2; 

Jo 


( 8 ) 
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4 Proof of the main result 


The proof can be decomposed into two steps. Recall that r is fixed from the 
beginning and that Xf actually depends on r through the stopping time T. 
The first step is to prove that if <5 is small then /i and fj, s are not too different. 

Proposition 5. Assuming ([3]), we have 

<fav(l*, V s ) < <V(/3 + !) log r, 


for every S > 0, where dry denotes the total variation distance. 

The second step is to argue that /(X-f) tends to be bigger than f(Xi). An 
intuition for this property is that the difference between Xf and X- ; is somehow 
in the direction of X f(Xi). 

Proposition 6. Assuming 13]), we have 

P (f(Xf) < r 1+2S e~ 4 ) < P (f(X 1 ) <r) + (/3 + 4 )6 2 log(r), 


for all S > 0. 

Remark. Note that both propositions use the convexity hypothesis 131). 

It is now very easy to prove Theorem [4] Since Xi has law /r, all we need to 
prove is 


We choose 


P (f(X 1 )G(r,er])<C 


6 = 


max(/3,1) 
VTogr 


2 logr 


For this value of 6, Proposition [6] gives 

P(/(*i 5 ) > er) < P (fiX,) < r) + f 

4 logr 

whereas Proposition [5] yields 

P(/(Ad) < er) < P(/(Xf) < er) + dry (»,»*) 

5 f/3+l\ 


< P (fiX 4 ) < er) + - 
Combining the two inequalities we obtain 

P(/(Ai)<er)<P(/(X 1 )<r) + C 


1/2 


logr J 
max(/3,1) 


Vlogr 

which is the result. 

Remark. We actually prove the slightly stronger statement: 

/ max(/3,1) /max(/3,1) \ 


P(/(Xi) G (r,er]) < C max 


l logr ’ \ logr 
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5 Proof of the total variation estimate 


We actually bound the relative entropy of // with respect to /i. Recall that 
log / is assumed to be weakly convex: there exists /3 > 0 such that 

V 2 log / > —/3 id, (9) 


pointwise. 

Proposition 7. Assuming ©, we have 

H(// | n) < <5 2 (/3 + l)logr, 


for all S > 0 . 

This yields Proposition [5] by Pinsker’s inequality. 
Proof. Observe that 


H(/ | /x) = H(/ | 7 ) - / log(/) dfj, 5 . (10) 

jR n 

Now © gives 

log > log f(X,) + (Vlog/(Ad), Af - X x ) - || X{ - Xrf, 

f T 2 fT (11) 

> logZ(-X'i) + s / {Vi,v s )ds - — \v s \ 2 ds, 

Jo 1 Jo 

almost surely. We shall use this inequality several times in the sequel. Recall 
that A'i has law /x and that A') 5 has law fi s . Taking expectation in the previous 
inequality and using m we get 


H(m A I d) < H(/x <5 | 7 ) - H(/x | 7 ) 


-6 E 


(vi,v s ) ds 


f35 2 



Together with © and (J5J we obtain 


H(/x 5 | /x) < <5 E 



- v s ,v s ) ds 


(1 + P)S\ 
2 



Now since (vt) is a martingale and T a stopping time we have 
E [(ui,v«) 1 {s<t}] = E [ks| 2 l{s<r}] 

for all time s < 1. This shows that the first term in the previous inequality is 0. 
To bound the second term, observe that the definition of T and the equality © 
imply that 

fT i fT 

/ (v s ,dB s ) + - |^s | 2 ds < logr, 

Jo Z Jo 
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almost surely. Since (v t ) is a bounded drift, the process (J^(v s ,dB s )) is a 
martingale. Now T is a bounded stopping time, so by the optional stopping 
theorem 


E 


A 


,dB s ) 


= 0 . 


Therefore, taking expectation in the previous inequality yields 


E 



< 2 log r, 


which concludes the proof. 


□ 


6 Proof of Proposition [6] 

The goal is to prove that 

P (f(X() < r 1 + 2 <s e -4 ) < P(/(Xi) < r) + 6 2 ((3 + 4) logr. 


Obviously 

P (/( X*) < r 1 + 2 < 5 e -4 ) < P(/(Xi) < r) + P < r 1+25 e ~ 4 ; f{X x ) > r) 

Now recall the inequality m coming for the weak convexity of log / and rewrite 
it as 

log f{X() > I<i + 2 5K t + Y 
where (K t ) is the process defined by 


K t = log(Pi_ t )(/)(X t ) = [ {v s ,dB s ) + i [ |u s | 2 ds, 

Jo z Jo 

and Y is the random variable 

pT pT / > ^ n 

Y = —26 / (v s ,dB s )+6 / (vi-v s ,v s )ds -— / \v s \ 2 ds. 

Jo Jo J Jo 

Recall that the stopping time T is the first time the process ( K t ) exceeds the 
value logr if it ever does, and T = 1 otherwise. In particular, if 

K i = log/(Xi) > logr 

then Kt = logr. So if f(X i) > r then 

f{X() > r 1 + 2 S e Y . 

Therefore 

P (f(Xf) < r 1+25 e- 4 ; f(X 1 ) > r) < P (Y < -4). 
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So we are done if we can prove that 


P(Y < —4) < (0 + 4)(5 2 log r. 


( 12 ) 


There are three terms in the definition of Y. The problematic one is 


^ [ (vi 

Jo 


- v s ,v s ) ds. 


We know from the previous section that it has expectation 0. A natural way 
to get a deviation bound would be to estimate its second moment but it is not 
clear to us how to do this. Instead we make an complicated detour. 

Lemma 8. Let Z be an integrable random variable satisfying E[e z ] < 1. Then 


P(Z < -2) < —E [Z\. 


Remark. Note that E [Z\ < 0 by Jensen’s inequality. 


Proof. Simply write 

E [e z ] > E [e z l{z>-2}] 

> E [(Z + 1) l{z>_ 2 }] 

= E[Z\ — E [Z l { z<_2 } ] + 1 - P(Z < -2) 

> E[Z] + P (Z < -2) + 1. 

So if E[e z ] < 1 then P (Z < -2) < -E[Z). 

Lemma 9. Let Z be the variable 

z = -S f ( v s ,dB s )+S f (v 1 - v s ,v s ) ds - ^ f \v s \ 2 ds. 

Jo Jo * Jo 


Then 


P (Z < —2) < 5 2 {i3 + 1) log r. 


□ 


Proof. As we have seen before the first two terms in the definition of Z have 
expectation 0 and 


m = 


(£±1M! E 

2 



> —<5 2 (/3 + 1) logr. 


By Lemma[ 8 ]it is enough to show that E{e z ] < 1. To do so, we use the Girsanov 
change of measure formula. The process [Xf ] is of the form Brownian motion 
plus drift: 


x* = x t + s 



v s ds 


= B t +[ (1 + <51{ s <t}K ds, 
Jo 
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Note also that the drift term is bounded. Therefore, Girsanov’s formula applies, 
see for instance [5] chapter 6] (beware that the authors oddly use the letter M 
to denote expectation). The process (Df) defined by 

D t = ex P J Q t 1 + sl {s<T})(v s ,dB s ) ~l, J |(1 + 1{ s <t}H| 2 ds 

is a non-negative martingale of expectation 1 and under the measure Q s defined 

by 

d,Q s = D{ dP 

the process {Xf ) is a standard Brownian motion. In particular 

P[f{Xi)D{] = ElfiB,)} = 1 . 


Now we use inequality (HD once again. A tedious but elementary computation 
shows that it gives exactly 


f(Xf) Df > e z . 


Therefore E[e z ] < 1, which concludes the proof. 


□ 


We now prove inequality m- The idea being that the annoying term in Y 
is handled by the previous lemma. Observe that 


So 


f 1 5 2 f 

Y = Z — 5 / {v s ,dB s )~ — 

Jo z Jo 


1 ds. 


P (Y < -4) < P(Z < -2) + P (^6 {v s ,dB s ) > 1 


+ P 


S 2 


‘ ds > 1 


Recall that f Q ( v s ,dB s ) has mean 0 and observe that 


f f T \ 2 


’ r T 

6 / ( v s ,dB s ) 

= S 2 E 

/ |^s| 2 ds 

\ Jo J 


Jo 


< 25 2 log r. 


So by Tchebychev inequality 


P ^<5 y ( v s ,dB s ) > 1^ < 2S 2 log r. 


Similarly by Markov inequality 


(y J \v s \ 2 ds > 1^ < S' 2 logr. 


Putting everything together we get m , which concludes the proof. 


11 






References 


[1] Ball, K.; Barthe, F.; Bednorz, W.; Oleszkiewicz, K.; Wolff, P. 
L\-smoothing for the Ornstein-Uhlenbeck semigroup. Mathematika 59 
(2013), no. 1, 160-168. 

[2] Eldan, R.; Lee, J. Regularization under diffusion and anti-concentration 
of temperature. arXiv:1410.3887. 

(31 Gross, L. Loqarithmic Sobolev inequalities. Amer. J. Math. 97 (1975), 
no. 4, 1061-1083. 

[4] Lehec, J. Representation formula for the entropy and functional inequal¬ 
ities. Ann. Inst. Henri Poincare Probab. Stat. 49 (2013), no. 3, 885-899. 

[5] Liptser, R.; Shiryaev, A. Statistics of random processes. Vol I, gen¬ 
eral theory. 2nd edition. Stochastic Modelling and Applied Probability. 
Springer-Verlag, Berlin, 2001. 

[6] Nelson, E. The free Markoff field. J. Functional Analysis 12 (1973), 211 
227. 

[7] Talagrand, M. A conjecture on convolution operators, and a non- 
Dunford-Pettis operator on L\. Israel J. Math. 68 (1989), no. 1, 82-88. 


12 


